How Famous is a Scientist? 



— Famous to Those Who Know Us. 
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Following a recent idea, to measure fame by the number of Google hits found in a search on the 
WWW, we study the relation between fame [Google hits) and merit (number of papers posted on 
an electronic archive) for a random group of scientists in condensed matter and statistical physics. 
Our findings show that fame and merit in science are linearly related, and that the probability 
distribution for a certain level of fame falls off exponentially. This is in sharp contrast with the 
original findings about WW II ace pilots, for which fame is exponentially related to merit (number 
of downed planes), and the probability of fame decays in power- law fashion. Other groups in our 
study show similar patterns of fame as for ace pilots. 
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The concept of fame is socially and economically im- 
portant to many people, and the organizations to which 
they belong. However, it is not a well defined concept, 
since each person has their own idea of what it means to 
be famous, including perhaps: recognizable to the "com- 
mon" person on the street (but how do we define the com- 
mon person?), being on television, appearing frequently in 
newspapers and in other media. Recently, researchers at 
UCLA, Simkin and Roychowdhury pj performed an em- 
pirical study in which they catalogued the fame of World 
War I pilot "Aces." For reasons of specificity in measure- 
ment, they chose an interesting definition of fame: the 
number of hits a search for a person's name garners in 
the Google search engine. In this view, our fame is taken 
to be how well linked we are in what has quickly become 
a most popular medium — the World Wide Web — and 
is related to the number of webpages that mention us, as 
measured by the PageRank system that is behind Google's 
popular search engine. This is an ingenious idea, in that it 
provides an inexpensive measurement of social impact, by 
enlisting powerful computer resources, freely available to 
all, to perform what would otherwise be an expensive so- 
cial study. However, we mention that in practice even this 
measurement is hard to make completely precise without 
a great deal of effort, due to difficulty in separating coinci- 
dences in popular names (doubling and tripling a person's 
fame), and also possibly missing a person's full fame due 
to too restrictive a search. Nonetheless, in this study, it is 
precisely this Google-hits measure that we adapt, as care- 
fully specified below. 

The purpose of this communication is to explore 
whether there is a difference between relative fame and 
achievement (merit) in science, as compared to the find- 
ings for ace pilots. In the UCLA study of fighter pilots p], 
a pilot's achievement was measured by how many enemy 
planes the pilot had downed; was he an ace? It was found 
that fame increases exponentially with achievement, while 
the distribution of fame falls off algebraically, nearly as 
(fame)~ 2 . A model mechanism behind these findings was 
presented, describing the social context of fame within a 
random graph. We have catalogued similar measurements 
of fame and achievement for scientists working in the area 



of condensed matter or statistical physics. We find dra- 
matically different behavior, with fame increasing linearly 
with achievement, and its probability falling off exponen- 
tially. A simple argument to account for these facts sug- 
gests that in difference to ace pilots, that enjoy public 
renown, scientist are well known mostly within their own 
community and do not truly reach real fame. Preliminary 
studies of other groups of people reveal similar fame pat- 
terns as for ace pilots. 

We begin with a brief summary of the findings in pj. 
The distribution of fame F (as measured by Google hits) 
falls off roughly as an inverse square, 



P(F) ~ F~ 



7*2, 



(1) 



while it rises exponentially with achievement A (number 
of downed planes) , 



F(A) 



(2) 



The two relations imply that achievement ought to be ex- 
ponentially distributed: 



P(A) 



a = /3( 7 - 1) « (3. 
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This is indeed confirmed from independent measurements. 

Our findings for scientists — researchers in the area 
of condensed matter and statistical physics - - are 
dramatically different. We have examined a list of 
449 researchers, drawn randomly from among those 
who post articles on the web-based electronic board 
http://www.arxiv.org/archive/cond-mat As a measure of 
fame, we used the UCLA Google-hit criterion, with search 
lexicon: "Author's name" AND "condensed matter" OR 
"statistical physics" OR "statistical mechanics" . The dis- 
tribution of fame, as measured by the number of hits, 
decays exponentially, rather than in power-law fashion 
(Fig. HI: 



P(F) 
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n = 0.00102 ± 0.00006. 



(4) 



The figure results, as well as the computed correlation for 
the two possibilities (R 2 = 0.977 for the exponential, vs. 



2 




F 

FIG. 1: Distribution of fame in science. The distribution is 
clearly better fitted by an exponential, P(F) ~ 0.26e" 001F 
(curve, R 2 = 0.98), than by a power- law (straight line, R 2 = 
0.82) 
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FIG. 2: Fame of individual scientists {Google hits) versus 
achievement (number of papers posted in the /cond-mat e- 
archive). The data (o) is better fitted by an almost linear 
relation, F ~ cA (straight line), than by the exponential de- 
pendence found in 



0.82 for a power-law), leave little doubt as to which is the 
better fit. 

Next, we consider the relation between fame and 
achievement in science. As a simple-minded measure of 
achievement we take the total number of publications by 
an author on the cond-mat board, perhaps thus paying 
too much heed to the popular dictum "publish or per- 
ish." (The cond-mat board has been active since 1991.) 
Our findings are summarized in Fig. [2J The wide scat- 
ter of data points resembles that found for ace pilots by 
the UCLA team, but our best fit indicates a power-law 
(almost linear) increase of fame with achievement, rather 
than the exponential dependence found there. Indeed, we 
find R 2 = 0.513 for a power-law fit, vs. 0.328 for an ex- 
ponential: quite the reverse from [l|, which (for 393 ace 
pilots) cites R 2 — 0.72 for the exponential, vs. 0.48 for the 
linear fit. Our findings suggest the linear relation: 

F(A)~cA e ; £ = 0.97 ± 0.04 «1. (5) 

If Eqs. 0, © are right (with £ = 1), it follows that 
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FIG. 3: Distribution of achievement in science, as measured 
by the number of papers posted by each author on the web 
archive http://arxviv.org/archive/cond-mat The data (o) is 
better fitted by an exponential, P{A) = 0.25e"°' 031A (curve, 
R 2 = 0.90), than by a power- law (straight line, R 2 — 0.76). 

achievement in science is distributed exponentially: 

P{A)^e- /A ; v = cq. (6) 

Indeed, independent measurements of the probability 
distribution of achievement do support this prediction 
(Fig. I3J). Moreover, the value found from a best fit 
for v = 0.031 ± 0.004 is consistent with c = 25 ± 1, 
7} = 0.00102 ± 0.00006, and cry = 0.0255 ± 0.0025 found 
from the two previous plots. We note that the observed 
exponential decay of the probability of achievement in sci- 
ence is the only feature that seems to be shared with the 
fame- achievement question in the case of ace pilots. 

What could be the reason for the different fame vs. merit 
patterns found for scientists and ace pilots? One likely dif- 
ference is in the set of people who author webpages that 
refer to ace pilots as opposed to those who write webpages 
about scientists. In [l| the authors explained their results 
using a "rich-get-richer" scheme 3), whereby individuals 
that are already popular attract people to generate new 
webpages at a rate proportional to their current popular- 
ity. Implicit in this mechanism is the assumption that 
there is an inexhaustible (or at least very large) pool of 
people that may author webpages on a popular subject. 
Such an assumption might perhaps be justified for a sub- 
ject that enjoys wide notoriety within the public at large 
— in other words, for truly famous subjects. We maintain 
that, for whatever reason, scientists are simply not known 
to the general public, thereby curtailing the option of a 
rich-get-richer growth. Instead, it is mostly scientist that 
write webpages about other scientists in their own disci- 
pline. In fact, a simple explanation to the linear increase 
of fame (number of Google hits) and achievement (number 
of papers published) in science is that scientists get cited 
on other scientists' webpages in relation to their published 
work. If each published work typically generates citations 
in c webpages, it follows that A publications would con- 
nect a scientist with F = cA webpages, on average. 

It is also worth noting that it is not merely the rela- 
tion between fame and merit that is different for scientists 
and ace pilots, but also the distribution of fame itself. This 
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FIG. 4: Fame distribution for 291 athletes in various events: 
100 m, 200 m, 1,500 m, and 10,000 m running, and 10,000 m 
walking. The data (o) is well-fitted by a power-law (straight 
line) with exponent 1.55. 
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FIG. 5: Fame distribution among 263 movies and television 
actors. The data (o) is well fitted by a power-law (straight line) 
with exponent 2.03 ± 0.14, in line with the prediction in 

gives us a valuable simpler way to explore fame among var- 
ious groups of people, for while achievement is not usually 
easy to define, one can always use the Google-hit num- 
ber as a universal measure of fame. We expect that truly 
famous groups would exhibit a power-law distribution of 
fame, as for ace pilots, while those that are famous only 
within their own community might display a fast decay- 
ing or exponential distribution, as for scientists. We have 
tested this idea against a group of 291 Olympic runners 
(all distances) and a group of 263 television and movie 
actors (randomly drawn from an alphabetical list). The 
data, presented in Figs. 0] and [5] show that in both cases 
the probability of a certain level of fame drops off as a 
power-law, similar to Eq. l(T]l. implying that "true" fame 
has been achieved — fame within the public at large. It 
is interesting, though, that the exponent 7 « 1.55 found 
for runners is quite different from 7 = 2 predicted by the 
theory advanced for ace pilots |l|. 

The fame-merit behavior found for ace pilots is well 
documented at least in one other case, and independently 



from the Google-hit measure of fame. Consider a network 
of social contacts, where two individuals (nodes) are con- 
nected if they know each other on a first name basis. In 
this case, the number of links k coming out from a node 
constitutes an obvious measure of fame, F = k. As a mea- 
sure of achievement, we may take the distance of a node 
from an individual at the top of the social ladder (the 
president of the USA, say). A node is distance I from the 
president if the shortest path connecting between the two, 
along the links of the net, consists of / links. Thus, people 
immediately connected to the president boast 1 = 1, while 
those connected to them are (at least) distance I = 2 away, 
etc. We shall deem being closer to the president to repre- 
sent a higher level of social achievement, that is A = —I. 
It is well known that social nets of contacts are scale-free, 
possessing a degree distribution P(k) ~ k 1 . Hence, 

P(F) ~ F~ 7 , 

as in Eq. (|TJ. Meanwhile, since the number of nodes at 
distance / increases as (k) 1 (where (k) is the average degree 
of a node), we have 

P(A) ~ e~ aA , a = ln(fc). 

The two relations imply 

F(A) ~ J A, = a/( 7 - 1), 

exactly as found for ace pilots. Note however that in 
this example the fame exponent 7 need not equal 2. On 
the other hand, we know from recent studies on network 
growth that some kind of rich-get-richer mechanism is in- 
variably involved, whether implicitly or explicitly. 

To summarize, the relations for fame and merit found 
for ace pilots are probably universal among groups of peo- 
ple that enjoy true fame, within the general public. Behind 
these relations there is a possible rich-get-richer mecha- 
nism for the acquisition of fame, though the details might 
vary from one group to the next. Most importantly, the 
nature of fame can be studied on its own with the cheap 
and easy measure of number of Google hits: power-law 
distributions indicate true fame. 

For scientists, we find completely different relations, in- 
cluding an exponentially decaying distribution of fame. 
This suggests that scientists (at least as measured by 
Google hits) are primarily famous within their own science 
community, and do not enjoy public fame as other groups 
in the study do. Needless to say, our study excluded the 
limited number of truly famous scientists, such as Albert 
Einstein (1,660,000 hits), Isaac Newton (902,000), Galileo 
Galilei (245,000), Richard Feynman (112,000), and per- 
haps a handful of others. Their popularity pales in com- 
parison with other public figures, such as the performers 
Michael Jackson (5,570,000), Janet Jackson (3,190,000), 
and Barry Williams (2,400,000) . 

Our findings for scientists might of course be affected by 
our choice of the Google-hit measure. Had we measured 
fame in any other way, we may have found the telltale 
power-law distribution characteristic of other groups. A 
reasonable argument is that scientists make heavy use of 
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the WWW for propagation of their work, and as a result them by the general public. In this respect, there is com- 
the number of webpages dictated by their professional ac- fort in our poor showing of fame, for at the same time it 
tivities overwhelms the number of webpages written about is an indication of a productive level of research p| . 
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